Scheduling and Energy Efficiency Improvement Techniques for Hadoop Map-reduce: State of Art and Directions for Future Research

نویسندگان

  • Nidhi Tiwari
  • Umesh Bellur
چکیده

MapReduce has become ubiquitous for processing large data volume jobs. As the number and variety of jobs to be executed across heterogeneous clusters are increasing, so is the complexity of scheduling them efficiently to meet required objectives of performance. This report presents a survey of some of the MapReduce scheduling algorithms proposed for such complex scenarios. A taxonomy is provided for Map-reduce algorithms based on their runtime nature. The algorithms proposed for each hierarchical level of MapReduce scheduling are described in detail. Some pointers for future research to further improve the scheduling techniques are provided. Another aspect of MapReduce is that the size of their clusters is usually in hundreds and thousands, while it is used for processing infrequent batch and interactive jobs in parallel across these machines. Thus there is a need to look at energy efficiency of MapReduce clusters. A survey of some of the techniques proposed to improve MapReduce energy efficiency is done. The studied techniques have been classified based upon the MapReduce component they work-on. Details of techniques in each category are provided. Few suggestions for future research are given based on the gaps observed in these works.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive Dynamic Data Placement Algorithm for Hadoop in Heterogeneous Environments

Hadoop MapReduce framework is an important distributed processing model for large-scale data intensive applications. The current Hadoop and the existing Hadoop distributed file system’s rack-aware data placement strategy in MapReduce in the homogeneous Hadoop cluster assume that each node in a cluster has the same computing capacity and a same workload is assigned to each node. Default Hadoop d...

متن کامل

Beyond Hadoop: Recent Directions in Data Computing for Internet Services

As a main subfield of cloud computing applications, internet services require large-scale data computing. Their workloads can be divided into two classes: customer-facing query-processing interactive tasks that serve hundreds of millions of users within a short response time and backend data analysis batch tasks that involve petabytes of data. Hadoop, an open source software suite, is used by m...

متن کامل

Critical Path Method for Flexible Job Shop Scheduling Problem with Preemption

This paper addressed a Flexible Job shop Scheduling Problem (FJSP) with the objective of minimization of maximum completion time (Cmax) which job splitting or lot streaming is allowed. Lot streaming is an important technique that has been used widely to reduce completion time of a production system. Due to the complexity of the problem; exact optimization techniques such as branch and bound alg...

متن کامل

Energy Conservation in Building

The building sector accumulates approximately a third of the final energy consumption. Consequently, the improvement of the energy efficiency in buildings has become an essential instrument in the energy policies to ensure the energy supply in the mid to long term moreover is the most cost-effective strategy available for reducing carbon dioxide emissions This paper is studying the main objecti...

متن کامل

Performance of Building Energy Efficiency by Orientation with Regression: a Case of Semi Desert in Iran

In this research multiple-regression analysis with stepwise selection method was employed for investigating the effectof vertical building envelopes solar radiation (Evr) on cooling energy consumption (E cooling) in residential sector.The high capacity of solar energy in semi-arid climate (Shiraz) can provide a part of buildings required energy. Dependson house orientations in two directions of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012